Voice quality and f0 cues for affect expression: implications for synthesis

نویسندگان

  • Irena Yanushevskaya
  • Christer Gobl
  • Ailbhe Ní Chasaide
چکیده

Synthesised stimuli were used to investigate how two notionally separable dimensions of tone-of-voice – voice quality and fundamental frequency – are involved in the expression of affect. Listeners were presented with three series of stimuli: (1) stimuli exemplifying different voice qualities, (2) stimuli all with modal voice quality but with different affect-related f0 contours, and (3) stimuli incorporating variation in both voice quality and affect-related f0 contours. A total of 15 stimuli were rated for 12 different affective attributes. Voice quality differentiation appears to account for the highest affect ratings overall, as indicated by the scores obtained for stimuli series (1) and (3). The relatively weaker affect signalling of stimuli differentiated by f0 alone corroborates findings in [2]. It also suggests that for the generation of expressive, affectively coloured speech synthesis, it is not sufficient to manipulate only f0; we also need to capture the voice quality dimension of the voice source.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mapping Voice to Affect: Japanese listeners

This paper reports the results of perception tests administered to speakers of Japanese as part of a cross-language investigation of how voice quality and f0 combine in the signalling of affect. Three types of synthesised stimuli were presented: (1) ‘VQ only’ involving variations in voice quality and a neutral f0; (2) ‘f0 only’, with different f0 contours and modal voice; and (3) combined ‘VQ +...

متن کامل

Production of English Lexical Stress by Persian EFL Learners

This study examines the phonetic properties of lexical stress in English produced by Persian speakers learning English as a foreign language. The four most reliable phonetic correlates of English lexical stress, namely fundamental frequency, duration, intensity, and vowel quality were measured across Persian speakers’ production of the stressed and unstressed syllables of five English disyllabi...

متن کامل

Intonation issues in HMM-based speech synthesis for Vietnamese

In an HMM-based Text-To-Speech system, contextual features, including phonetic and prosodic factors have a significant influence to the spectrum, F0 and duration of the synthetic voice. This paper proposes prosodic features aiming at improving the naturalness of an HMM-based TTS system (VTed) for a tonal language, Vietnamese. The ToBI (Tones and Break Indices) features are used to learn two cru...

متن کامل

Effect of Functional Endoscopic Sinus Surgery on the Voice Quality among Patients with Rhinosinus Polyposis

Introduction: Rhinosinus polyposis is associated with voice quality reduction. There has been little evidence about the efficacy of rhinosinus polyps surgery on patients' voice quality so far. The aim of the present study was to evaluate the nasality and acoustic voice changes after rhinosinus polyposis surgery.   Materials and Methods: The population in this study compo...

متن کامل

Voice Analysis in English and Persian Persuasive Texts: Pedagogical implications in focus

The main purpose of this study is to investigate how voice is realized by Iranian EFL learners in persuasive English and Persian text types. This discourse-related notion is a required criterion for writing acceptable English. However, L2 learners from cultures other than English might face problems in realizing it, or even ignore it all through their writing. In this connection, the present st...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005